Claims 

What is claimed is: 

7. A method for correcting errors for consensus decoding of speech, the 

5 method comprising the steps of: 

creating a confusion set comprising a plurality of candidate words, each of 
said candidate words having an associated score; and 

selecting a candidate word from the confusion set, wherein a candidate 
word other than a candidate word having a highest associated score is selected when one 
10 or more criteria are met. 

2. The method of claim 1, wherein the step of selecting a candidate word 
further comprises the step of applying a rule to the confusion set, the rule determining 
which of the candidate words is selected. 

15 

3. The method of claim 1, further comprising the step of outputting the 
selected word. 

4. The method of claim 2, wherein the step of applying further comprises the 
20 steps of: 

applying the rule to the confusion set when a candidate word in the 
confusion set has a posterior probability that is not greater than a predetermined 
threshold, wherein the step of applying selects one of the candidate words in the 
confusion set; and 

25 selecting the candidate word having the posterior probability that is greater 

than a predetermined threshold when this candidate word has a posterior probability that 
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is greater than a predetermined threshold. 



5. The method of claim 1 , further comprising the steps of: 

determining a plurality of allowable transformations on a plurality of 

5 training confusion sets; 

providing an objective function; and 

learning a plurality of corrective rules for the training confusion sets, the 
step of learning using the allowable transformations and objective function. 



10 
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6. The method of claim 1, further comprising the step of learning when to 
select a candidate other than a candidate having a highest associated score. 

7. The method of claim 1, wherein said associated score is a posterior 
probability. 

8. The method of claim 1, wherein the step of selecting a candidate word 
further comprises the step of employing a data classifier. 



9. The method of claim 8, wherein the data classifier is a decision-tree. 

20 

10. A method for determining a plurality of corrective rules from a plurality of 
training confusion sets, the method comprising the steps of: 

determining a plurality of allowable transformations on the training 

confusion sets; 
25 providing an objective function; and 

learning a plurality of corrective rules for the training confusion sets, the 
step of learning using the allowable transformations and objective function. 
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11. 



The method of claim 10, further comprising the steps of: 
determining a real-time confusion set; and 

applying at least one of the corrective rules to the real-time confusion set. 



12. The method of claim 10, wherein the real-time confusion set comprises a 

plurality of candidate words, and wherein the step of applying further comprises: 
determining a real-time confusion set; 

determining if a candidate word has a posterior probability that is greater 
1 0 than a predetermined threshold; 

applying at least one of the corrective rules to the real-time confusion set 
when the candidate word has a posterior probability that is not greater than a 
predetermined threshold, wherein the step of applying selects one of the candidate words 
in the confusion set; 

15 selecting the candidate word having the posterior probability that is greater 

than a predetermined threshold when the candidate word has a posterior probability that is 
greater than a predetermined threshold; and 

outputting the selected candidate word. 

20 13. The method of claim 10, wherein the step of learning further comprises the 

steps of: 

extracting a plurality of features from each of the training confusion sets, 
each of the features mathematically describing a characteristic of an associated one of the 
confusion sets. 



25 



I4 t The method of claim 13, wherein the features comprise one or more of the 

following: 
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word identity of at least one candidate word in a training confusion set; 
duration of at least one candidate word in a training confusion set; 
posterior probability of at least one candidate word in a training confusion 

set; 

5 difference in posterior probabilities of two candidate words in a training 

confusion set; 

temporal position of a training confusion set in a sentence comprising a 
plurality of training confusion sets; and 

number of candidate words in a training confusion set. 

10 

15. The method of claim 13, wherein each of the allowable transformations 
comprises a template rule, wherein the step of learning further comprises the step of: 

instantiating a plurality of template rules, each of the template rules having 
a form wherein one candidate word of a confusion set is selected if at least one 
15 predetermined criterion is met, each criterion comprising a selected feature, an operation 
and a threshold value for the selected feature. 

16. The method of claim 15, further comprising the steps of: 
providing a baseline predictor; 

20 selecting a training confusion set; 

determining which of the template rules are applicable to the selected 
training confusion set, whereby each applicable rule is a candidate rule; 

determining values of the features in each of the at least one predetermined 
criteria, the values determined from the selected training confusion set; 
25 scoring each of the candidate rules by using the objective function; 

selecting a highest scoring candidate rule; 

applying the highest scoring candidate rule to the baseline predictor to 
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create a modified consensus hypothesis; and 

selecting the highest scoring candidate rule as a corrective rule. 

17. The method of claim 10, wherein the method further comprise the step of 
providing a truth, the truth indicating a correct word for each of the training confusion 
sets, wherein the step of scoring comprises the steps of: 

selecting a candidate rule; 

for each of the training confusion sets, performing the following steps: 

determining if the selected candidate rule chooses or does 

not choose a correct word, as determined from the truth, from the 

candidate words in the corresponding one of the confusion sets; 

providing a value determined from a number of incorrect 

words selected subtracted by a number of correct words selected. 

18. A system for correcting errors for consensus decoding of speech, 
comprising: 

a memory that stores computer-readable code; and 

a processor operatively coupled to said memory, said processor configured 
to implement said computer-readable code, said computer-readable code configured to: 

create a confusion set comprising a plurality of candidate words, each of 
said candidate words having an associated score; and 

select a candidate word from the confusion set, wherein a candidate word 
other than a candidate word having a highest associated score is selected when one or 
more criteria are met. 



19. A system for determining a plurality of corrective rules from a plurality of 

training confusion sets, comprising: 

YOR920010337US1 -19- 



a memory that stores computer-readable code; and 
a processor operatively coupled to said memory, said processor configured 
to implement said computer-readable code, said computer-readable code configured to: 

determine a plurality of allowable transformations on the training 

5 confusion sets; 

provide an objective function; and 

learn a plurality of corrective rules for the training confusion sets, the step 
of learning using the allowable transformations and objective function. 

10 20. An article of manufacture for correcting errors for consensus decoding of 

speech, comprising: 

a computer readable medium having computer readable code means 
embodied thereon, said computer readable program code means comprising: 

a step to create a confusion set comprising a plurality of candidate words, 
15 each of said candidate words having an associated score; and 

a step to select a candidate word from the confusion set, wherein a 
candidate word other than a candidate word having a highest associated score is selected 
when one or more criteria are met. 

20 21. An article of manufacture for determining a plurality of corrective rules 

from a plurality of training confusion sets, comprising: 

a computer readable medium having computer readable code means 
embodied thereon, said computer readable program code means comprising: 

a step to determine a plurality of allowable transformations on the training 

25 confusion sets; 

a step to provide an objective function; and 

a step to learn a plurality of corrective rules for the training confusion sets, 
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the step of learning using the allowable transformations and objective function. 

22. A system for correcting errors for consensus decoding of speech, 
comprising: 

means for creating a confusion set comprising a plurality of candidate 
words, each of said candidate words having an associated score; and 

means for selecting a candidate word from the confusion set, wherein a 
candidate word other than a candidate word having a highest associated score is selected 
when one or more criteria are met. 

23. A system for determining a plurality of corrective rules from a plurality of 
training confusion sets, comprising: 

means for determining a plurality of allowable transformations on the 

training confusion sets; 

means for providing an objective function; and 

means for learning a plurality of corrective rules for the training confusion 
sets, the step of learning using the allowable transformations and objective function. 
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